AN INTERESTING NEW MAHONIAN PERMUTATION STATISTIC 

MARK C. WILSON 

Abstract. The standard algorithm for generating a random permutation gives rise 
to an obvious permutation statistic DIS that is readily seen to be Mahonian. We give 
evidence showing that it is not equal to any previously published statistic. Nor does its 
joint distribution with the standard Eulerian statistics des and exc appear to coincide 
with any known Euler-Mahonian pair. 

A general construction of Skandera yields an Eulerian partner eul such that (eul, DIS) 
(^ I is equidistributed with (des, MA J). However eul itself appears not to be a known Euler- 

O^ ■ ian statistic. 

Several ideas for further research on this topic are listed. 
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1. The statistic 



^j ■ 1.1. Random permutations. The standard algorithm [KnuSll 3.4.2, Algorithm P] 

^^ , for uniformly generating a random permutation of [n] := {1, ... ,n} is as follows. Start 

with the identity permutation l = 1 ... n in the symmetric group S„ . There are n steps 
labelled n,n — 1, . . . , 1 (the last step can be omitted, but it makes our notation easier 
to include it here). At step i a random position ji is chosen uniformly from [i] and the 
current element in position ji is swapped with the element at position i. 



Example 1. The permutation 25413 € S5 is formed by choosing J5 = 3,j4 = 1, J3 
^ I 1,J2 = l,ji = 1. Its inverse 41532 is formed by choosing J5 = 2,j4 = 3,J3 = 2,J2 



In terms of multiplication in S„, vr is a product of "transpositions" YliKni'^'Ji)- ^^y 
^^ of these "transpositions" may be the identity permutation. This representation as a 

Q ■ "triangular product" gives a bijection between S^ and the set of sequences (ji, ■ ■ ■ ,jn) 

i^ I that satisfy 1 < ji < i for all i. 

Knuth attributes this algorithm to R. A. Fisher and F. Yates |FY38] , and a com- 
puter implementation was given by Durstenfeld }Dur64j . Recently [Wil09j the present 
k>( \ author and others have studied the distribution of various quantities associated with the 

j_j ■ algorithm. 

1.2. The statistic. For each n > 1, there is a map S„ — ?> S„+i that maps tt to the 
permutation tt^ that fixes n + 1 and agrees on 1, . . . ,n with vr. We let S be the direct 
limit of sets induced by these maps. If we think of each ^ as an inclusion map, as is 
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common, then S is simply the union of ah S„. For our purposes a permutation statistic 
is simply a function T : S ^- N. 

Of course it is always possible to construct a statistic T by for each n making it equal 
to a given statistic T„ on S„. However unless the values of T cohere for different values 
of n this is not useful. We define a statistic on S to be coherent if it satisfies the following 
property. To be coherent, the identity T{-k) = T{7r^) must hold for all n and vr € S„. 

We now define a (coherent) permutation statistic, which we denote by DIS, as follows. 

Definition 2. At step i of the algorithm described above, one symbol moves rightward 
a distance di = i — ji (possibly zero), and one symbol moves leftward the same distance. 
We define DIS(7r) = X^jdj, the total distance moved rightward by all elements. 

There is an alternative interpretation of DIS. The sequence of moves that formed vr 
starting from the identity will take vr"^ to the identity, and the moves are the same as 
selection sort. The algorithm then sorts vr"^ via selection sort. We can think of DIS as 
a measure of the work done by selection sort when comparisons have zero cost. This 
model might be useful in analysing, for example, physical rearrangement of very heavy 
distinct objects. 

In view of the last paragraph it makes sense also to consider the statistic IDIS given 
by IDIS(7r) = DIS(7r~i). f 

Example 3. For our running example 25413, the value of DIS is (1 + 2 + 3 + 4 + 5) — 
(1 + 1 + 1 + 1 + 3) =8, while for IDIS the value is 6. In terms of vr^^ = 41532, the swaps 
used to create vr yield successively 41235, 31245, 21345, 12345. 

Given a permutation for which we do not already know the ji, we can find these easily. 

Example 4. Given vr = 25413 as above, we can read off j^ = 3 from vr. Thus multiplying 
IT on the right by the transposition (35) leads to 23415. We have now reduced to ir = 2341. 
We now read off J4 = 1 and reduce to vr = 231. Continuing in this way we obtain 
h = l,i2 = l,i4 = 1- 

At first sight it may appear that we must search to find the position of symbol i 
at step i, leading to a quadratic time algorithm for the procedure of the last example. 
However this is not the case, provided we compute DIS and IDIS simultaneously, and 
the entire computation can be done in linear time (note that computing vr^^ from vr is a 
linear time operation). Note that, for example, it is still unknown whether the number 
of inversions INV of a permutation can be computed in linear time. 

Example 5. In the running example vr = 25413, vr^-*^ = 41352, we read off j^i^n) = 
3, j5(7r~^) = 2. To multiply vr on the right by the transposition (35) we need not scan all of 
vr, because we know the location of the symbol 5, namely j^{tt^'^) . Thus the multiplication 
takes constant time. We can either multiply vr^"*^ on the left by (35) or on the right by 
(25). Each leads to the same answer, namely 41235, and this is the inverse of the updated 
vr. Continuing in this way we obtain the result of the last example. 

It will be helpful to know the values of DIS on some special permutations. 
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Example 6. We define 

ttq = n{n — 1) ... 1 
TTi = 2 . . . nl 

TTf^ = nl2 . . . (n - 1) 

Note that ttq is created by the algorithm by choosing ji = n + l — i provided n+l — i < i, 
whereupon all later swaps are trivial. Also vri is created by choosing ji = 1 for all i, while 
TT^ is formed by choosing ji = i — 1 for i >2. Thus 

DlS(i) = 

„2 



n 
4 



DIS(7ro) = LnV4j = j ^ 

DIS(7ri) = n(n-l)/2 
DIS(7rf^) = n-l 



if n is even; 
if n is odd. 



The maximum value of DIS on S„ is n{n — l)/2, corresponding uniquely to the n-cycle 
TTi. The minimum value of DIS on S„ is 0, corresponding uniquely to the identity l. 

As a random variable, the restriction DIS^ of DIS to S^ is the sum of DIS„_i and a 
random variable C/„ that is uniform on [0..n — 1]. Thus, iterating this recurrence, we see 
that DIS„ has probability generating function F„(g) := niLi T^- This is the definition 
of a Mahonian statistic on S„. Note that n{n — l)/2 — DIS = ^jjj is also Mahonian by 
the symmetry of the Mahonian distribution. 

2. DIS IS NOT TRIVIALLY EQUAL TO A KNOWN STATISTIC 

Tabulating numerical values makes it clear that DIS is not equal to any of the most 
well-known Mahonian statistics. Table [T] gives the values of DIS and several other 
Mahonian statistics when n = 4 (it is amusing to note that they all coincide on the 
element 2134 - the obvious conjecture that they always coincide on 2134 ... n is in fact 
correct). These statistics are INV,MAJ,DEN,MAD,MAK,HAG. We recah the unified 
definition of these statistics given in [CSZ97J. We first require some partial statistics. 

Definition 7. A descent is an occurrence of the event 7r(i) > Tr{i + 1). The index i is 
the descent bottom and 7r(i) is the corresponding descent top. 

Each vr can be uniquely decomposed into descent blocks (maximal descending sub- 
words). Denote the first and last letter of each block of length at least 2 by c{B),o{B). 
The right embracing number of 7r(i) is the number of that are descent blocks strictly to 
the right of the block containing 7r(i) and for which c{B) > 7r(i) > o{B). The sum of all 
right embracing numbers is denoted by Res(7r). 

Example 8. For ni the descent blocks are all of length 1 except for the last one, nl. The 
right embracing number of each letter 2, . . . , n — 1 is 1 and the right embracing number of 
n and of 1 are each 0. For tt^ there is again a single nontrivial descent block, namely 



Table 1. Values of some permutation statistics for n = 4. 
vr DIS INV MAJ DEN MAD MAK HAG 
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nl, and all right embracing numbers are 0. For ttq there is a single descent block of 
length n and all right embracing numbers are 0. 

Definition 9. An excedance is an occurrence of the event 7r(i) > i. The index i is 
the excedance bottom and 7r(i) is the corresponding excedance top. The sum of all 
descent/excedance tops/bottoms of tt we denote by Dtop(7r),Etop(7r), Dbot(7r),Ebot vr. 
The differences Ddif (vr) and Edif (vr) are given by Ddif = Dtop — Dbot, Edif = Etop — Ebot. 

There is a unique decomposition tt into tte and -/rjv, where t^e is the subsequence 
formed by excedances and tttv the subsequence formed by nonexcedances. For our run- 
ning example vr = 25413, we have tie = 25 and vrjv = 413. For the inverse 41532 we have 
respectively 45 and 132. We define Ine(7r) = INV(7rs) + INV(7rAf). 

For each excedance bottom i we define L(i) to be the number of indices k such that 
k < i and a^ < i; let L be the sum over all such i. 

Example 10. Note that {'KiIe = 23 . . . n and {tti)n = 1- Similarly {tt^ )e = n o-nd 
{tt^ )n = 12 ... (n — 1). Also (vro)^ = n{n — 1) ... t + 1 and {'rTo)N = t . . A, where 
t = [n/2\ . 
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Table 2. Values of partial statistics on special permutations (t = [re/2j) 
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The values of the partial statistics defined above are tabulated in Table [2j 
Proposition 11 ([CSZ97]). We have — ■ 

MAK = Dbot + Res 
MAD = Ddif + Res 

DEN = Ebot + Ine 

INV = Edif + Ine 

HAG = Edif + INV(7r£;) - INV(7rAr) + L 

In addition MAJ is the sum of indices corresponding to descent tops. 



2.1. Trivial bijections. To show that statistics T and T' are different, it suffices to 
find some re and some tt G Sn for which T(tt) ^ T'^n). However it may be the case that 
T and T' agree on S„ for some larger values of re. If both T and T' are coherent, this 
possibility cannot occur. 

Note that DIS and IDIS, along with all statistics from previous literature with which 
we compare them here, are coherent. Thus simply computing values for small n, as in 
the previous section, is usually enough to distinguish the statistics. However we can 
often give a general construction of permutations for which a given pair of statistics 
differs greatly. 

Although DIS does not equal any of the well-known statistics of the previous section, 
is possible a priori that DIS has the form S o g where 5 is a known Mahonian statistic 
and g is a filtered bijection of S (a bijection of S that bijectively takes S„, to S„, for each 
re). 

In this section we consider the so-called "trivial" involutions of Sn (there is a nontrivial 
bijection $ of S„ introduced in [CSZ97J : we give more details in Section [221) These 
involutions are inversion (group-theoretic inverse), reversal (reverse the order of the 
letters) and complementation (subtract each letter from re + 1). Then in the obvious 
notation R and C commute and IR = CI, IC = RI. Thus /, R, C generate a group G 
isomorphic to the dihedral group of order 8. 
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For example we have ^ 

(25413)^ = 41532 

(25413)^ = 31452 

(25413)'^ = 41253 

(25413)^'^ = 25134 

(25413)^^ = 23514 

(25413)^*^ = 35214 

(25413)^^'^ = 41532 



We shall show that DIS is not trivially equivalent to any well-known statistic. In the 
absence of a standardized database of permutation statistics, we define "well-known" to 
mean "mentioned in at least one of the papers [ CSZ971 iBSOOj ". We define S to be the 
set consisting of well-known Mahonian statistics. 

In [BSOOj it is shown how all known "descent-based" Mahonian statistics can be 
written in terms of "Mahonian d-functions" for some d < 4. Each such d-function simply 
computes the numbers of occurrences of a certain generalized permutation pattern of 
length at most d, then sums this process over a finite number of such patterns. In 
particular in Table 1 of the above article, all 14 Mahonian 3-functions (up to trivial 
bijections) are given. In |CSZ97j the images of these statistics under a bijection $ were 
also considered. We consider this bijection in Section] 



Theorem 12. There do not exist S' G S and g G G such that DIS = S o g. 

Proof. Note that tTq = l = tTq* while ttq is a product of [n/2j disjoint transpositions, 
and hence ttq = ttq. Hence the orbit of ttq under G is the set {t, ttq} and this is also 
the orbit of l. The orbit of vri under G is disjoint from that of ttq and i. It consists of 
7ri,7rf\7rf = 7rf = In . . . 2, 7r{"^ = (n - 1) . . . In. 

It follows that ii S o g = DIS for some permutation statistic S and element g & G, 
then S{ttq) must equal zero or DIS(7ro). However it is readily seen by comparing with 
Table [2] that none of the statistics in }CSZ97j satisfy this property. This includes those 
mentioned in passing, such as LAG and SIST. 

Now consider the statistics in ^BSOO^ Table 1], given in terms of permutation pattern 
counts. Any pattern that is not strictly descending does not occur in ttq, so we need 
only count occurrences of ba, cba, cb — a,c — ba. Again, none of these lead to zero or 
DIS(7ro), since the number of occurrences of these four patterns in ttq is respectively 
n - 1, n - 2, (n - l)(n - 2)/2, (n - l)(n - 2)/2. 

Finally we consider Haglund's statistic HAG and a descent-based variant DAG as 
defined in [BSOOJ. The statistic DAG can be dealt with by counting pattern occurrences 
in TTo as above. However it is not as easy to differentiate HAG from DIS by using our 
special permutations. In fact when n is even, HAG and DIS take the same value on vro 
(they coincide with Edit). When n is odd, HAG is smaller than DIS by (n — l)/2. We 
instead use the permutation 7r2 = n2 . . . (n — 1)1 formed from t by a single transposition. 
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Table 3. Joint distributions (exc, DIS) and (exc, IDIS) for n = 5. 



Its orbit under G consists of itself and its reverse l(n — 1) . . . 2re, and DIS takes the values 
n — 1 and \{n — 2)^/4] respectively on these two elements. However, HAG(7r2) = 1. 

/ 

2.2. Euler-Mahonian pairs and nontrivial bijections. In |CSZ97j a bijection <l> of 
S„ was given and it was shown that $ had appeared (somewhat disguised) in sev- 
eral previous papers. The key property of $ is that it takes (des, Dbot, Ddif , Res) 
to (exc,Ebot,Edif,Ine). This then gives access to equidistribution results for Euler- 
Mahonian pairs. The term Euler-Mahonian refers in the literature to a bistatistic 
(e, M) such that e is Eulerian, M is Mahonian, and the joint distribution of (e, M) 
is the same as that of another well-known pair (e',M'). Originally the term was used 
only for (e',M') = (des, MAJ). Other authors, for example [ BSOOl [r)SZ97j allow more 
possibilities for (e',M'), and aim to classify these bistatistics up to equidistribution. 

In [BSOOl Table 2] seven equivalence classes (under equidistribution) of Euler-Mahonian 
pairs (des, T) were given for n = 5 (note that the second matrix, corresponding to MAJ, 
has an error: in the row indexed by des = 2, the entries listed as 14 should be 16). This 
corresponds to 14 Mahonian statistics T. It is easy to see that DIS does not occur in 
this table, because its maximum value occurs on vri and des(7ri) = 1, yet none of the 
seven distributions has a nonzero entry in the (1, 10) position. We can also check easily 
that (exc, DIS) has a different distribution from all the entries in the table. Thus if T' 
is the image of such a T under $, then T' ^ DIS. 

We still need to check HAG. By direct computation we can show readily that 

^(vro) = vTi 
^(vri) = (vri)"^ 



Suppose that T = DIS o$ for some T € S. Then T(7ro) = n(n— 1)/2 and ^'(vri) = n—\. 
Clearly HAG fails this test. If To^ = DIS then r(7ri) = \n^/A\ which again HAG fails. 
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Table 4. Joint distributions (des,DIS) and (des,IDIS) for n = 5. 



3. An Eulerian partner for DIS 

Skandera [Ska02) gave a general procedure for associating to each Mahonian statistic 
M another statistic e that is Eulerian and such that the pair (e, M) is Euler-Mahonian 
(equidistributed with (des,MAJ)). Of course, such an Eulerian statistic may not be 
known or particularly interesting. 

Applying this procedure to DIS yields an Eulerian statistic eul. Concretely, eul(7r) 
is obtained from the numbers di by listing them in order, and counting each time we 
encounter a number larger than the current record (the record being initialized to zero). 
For example, for our running example 25413 we have d = (0, 1, 2, 3, 2) and so eul takes 
the value 3. Also note that eul(7ro) = \{n — l)/2] while eul(t) = and eul(7ri) = n — 1. 

A well-known Eulerian statistic is the number of excedances exc. Now exc agrees with 
eul on ttq and vri. Also, eul and exc are equal when n = 3. Nevertheless, eul is not equal 
to exc in general, nor does it equal des. 

Eulerian statistics in the literature are less commonly found than Mahonian ones. As 
far as I am aware, eul is itself new, but this is based on much less evidence than the 
corresponding claim about DIS. 

4. Further comments 

The current paper gives substantial evidence that the statistic DIS is really new. In 
order to check thoroughly whether a permutation statistic is new to the literature, one 
would ideally check a database of such statistics. I have not found such a database. I 
propose that as a minimum, tables of values for n = 4, along with the joint distribution 
with exc and des for n = 5, be included in all papers dealing with this topic, to allow easy 
comparison. It would then be much easier to show that the entire group F generated by 
G and <I> does not have any element g with T o g = DIS for some known Mahonian T, 
since all such T of which I am aware are consistent. 

It may be desirable to find a "static" description of DIS and IDIS, which have been 
defined "dynamically" . I do not know a systematic way to do this (one possible idea is 
to find linear combinations of the above partial statistics that fit the values for small n). 



A related question is to determine whether DIS can be written as a Mahonian d-function 
for some d. 

The statistic IDIS should extend to words via the selection sort interpretation. Whether 
this statistic is Mahonian on words should be investigated and I intend to do this in fu- 
ture work. 

Note: As I was preparing this article I was made aware of completely independent 
recent work by T. K. Petersen [PetlOj that also discusses the statistic DIS and some 
generalizations. The intersection between the topics of these two papers is small, and 
the reader should consult both articles for a fuller picture. 
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